Using Jitter and Shimmer in speaker verification
نویسندگان
چکیده
Jitter and shimmer are measures of the fundamental frequency and amplitude cycle-to-cycle variations, respectively. Both features have been largely used for the description of pathological voices, and since they characterise some aspects concerning particular voices, they are expected to have a certain degree of speaker specificity. In the current work, jitter and shimmer are successfully used in a speaker verification experiment. Moreover, both measures are combined with spectral and prosodic features using several types of normalisation and fusion techniques in order to obtain better verification results. The overall speaker verification system is also improved by using histogram equalisation as a normalisation technique previous to fusing the features by support vector machines.
منابع مشابه
Jitter and shimmer measurements for speaker recognition
Jitter and shimmer are measures of the cycle-to-cycle variations of fundamental frequency and amplitude, respectively, which have been largely used for the description of pathological voice quality. Since they characterise some aspects concerning particular voices, it is a priori expected to find differences in the values of jitter and shimmer among speakers. In this paper, several types of jit...
متن کاملUsing voice-quality measurements with prosodic and spectral features for speaker diarization
Jitter and shimmer voice-quality measurements have been successfully used to detect voice pathologies and classify different speaking styles. In this paper, we investigate the usefulness of jitter and shimmer voice measurements in the framework of the speaker diarization task. The combination of jitter and shimmer voice-quality features with the long-term prosodic and shortterm spectral feature...
متن کاملVocal Parameters of Adults with Down Syndrome in Zahedan /Iran
Background & Aims: Previous studies have indicated significant differences in vocal parameters between children with Down syndrome and normal children, but there are challenges about these differences. In this study vocal parameters and Maximum Phonation Time (MPT) in adults with Down syndrome have been investigated. Method: This cross-sectional and analytic study was performed on 22 adults wit...
متن کاملAre Jitter and Shimmer comparable to perceptual voice analysis in healthy voices?
Introduction: Objective and perceptual acoustic voice assessments are recommended as complimentary parts for clinical voice examinations [1]. However, in hoarse voices there have been contradictory descriptions of the correlation between perceptual and objective acoustic analysis [2, 3]. One reason might be the low reliability of objective acoustic measurements in irregular voice signals, which...
متن کاملAutomatic speaker recognition as a measurement of voice imitation and conversion
Voices can be deliberately disguised by means of human imitation or voice conversion. The question arises to what extent they can be modified by using either method. In the current paper, a set of speaker identification experiments are conducted; first, analysing some prosodic features extracted from voices of professional impersonators attempting to mimic a target voice and, second, using both...
متن کامل